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Abstract. We prove the existence of solutions to Monge's transport problem 
between two compactly supported Borel probability measures in the Heisen- 
berg group equipped with its Carnot-Caratheodory distance assuming that the 
initial measure is absolutely continuous with respect to the Haar measure of 
the group. 



The classical Monge's transport problem refers to the problem of moving one 
distribution of mass onto another as efficiently as possible, where the efficiency 
criterion is expressed in terms of the average distance transported. It originates 
from a paper by G. Monge, Memoire sur la theorie des deblais et des remblais, 
in 1781. Rephrased and generalized in modern mathematical terms, we are given 
two Borel probability measures fj, and i/ona metric space (X, d) and we want to 
minimize 



among all transport maps T from /i to v 1 i.e., all /i-measurable maps T : X — > X 
such that T#fi = v, meaning that v{B) = ^,{T^ 1 (B)) for all Borel set B. 

In this paper, we are interested in Monge's transport problem in the Heisen- 
berg group (H™, d) equipped with its Carnot-Caratheodory distance. We prove the 
existence of an optimal transport map between two compactly supported Borel 
probability measures [i and v on HP assuming that the first measure [i is absolutely 
continuous with respect to the Haar measure £ 2 ™+ 1 of H™. 

Theorem 1.1. Let /i and v be two compactly supported Borel probability measures 
on HP 1 . Assume that fi <C C 2n+1 . Then there exists an optimal transport map 
solution to Monge's transport problem between \i and v, i.e., a ^.-measurable map 
T : H™ — > H™ such that T # ^i = v and 



Monge's transport problem in W 1 equipped with a distance induced by a norm 
has already been widely investigated. A first attempt to solve this problem goes 
back to the work of Sudakov [23]. It was however discovered some years later that 
the proof in [23] was not completely correct. In [13] a PDE-based alternative to Su- 
dakov's approach has been developed. The authors prove the existence of an optimal 
transport map in R™ equipped with the Euclidean norm under the assumptions that 
spt/tt n spt v = 0, /x, v <C C n with Lipschitz densities with compact support. Exis- 
tence results for general absolutely continuous measures /i, v with compact support 
have been obtained independently in 10 and [24] and have been extended to a Rie- 
mannian setting in [14] . The existence of a solution to Monge's transport problem 
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1. Introduction 




/ d(x,T(x)) dfi(x) = inf / d(x, S(x)) dfi(x) 
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assuming only that the initial measure [L is absolutely continuous has been proved 
in PQ, see also [2], [3J, [5]- All these later proofs roughly involve a Sudakov-type 
dimension reduction argument, via different technical implementations though, and 
require some regularity assumptions about the norm R™ is endowed with. For some 
time, it seemed that there were indeed some borderline cases about the norms that 
could not be attacked through these techniques. 

Recently another approach that does not go through Sudakov-type arguments 
and in particular does not require disintegration of measures has been developed in 
|12) . see also [11], to solve Monge's transport problem for general norms in W 1 . This 
approach relies on rather simple but powerful density results. In the present paper 
we follow closely this approach. We basically show that a very similar strategy can 
be implemented in the context of the Heisenberg group equipped with its Carnot- 
Caratheodory distance. The main features that play a role in this approach are 
that (HI™, d, C 2n+1 ) is a doubling polish metric measure space, a non-branching 
geodesic space and satisfies a so-called Measure Contraction Property. It is very 
likely that this approach can be extended to more general metric measure spaces, see 
Section |U We have chosen however to present the particular case of the Heisenberg 
group for simplicity, this space being moreover an instructive explicit example of 
non Riemannian space. Q 

The strategy starts by considering the nowadays classical relaxation of Monge's 
transport problem proposed by Kantorovich. In Kantorovich' formulation one con- 
siders transport plans, i.e., Borel probability measures on X x X with first and 
second marginals fi and v respectively. Denoting by II(/i, v) the class of all trans- 
port plans, one wants to minimize 



among all 7 € II(/i, v). Due to the linearity of the constraint 7 S II(/x, v), weak 
topologies provide existence of optimal transport plans. As a classical fact it turns 
out that whenever an optimal transport plan is induced by a /i-measurable map T, 
i.e., can be written in the form (/ <g> T)jj/x where (I (g> T){x) := (x, T(x)), then T is 
an optimal transport map between \i and v solution to Monge's transport problem. 
We follow here this scheme seeking after optimal transport plans that will be shown 
to be eventually induced by /i-measurable maps. 

In our present context we first prove that any optimal transport plan is concen- 
trated on a set of pairs of points that are connected by a unique minimal curve and 
that these transport rays cannot bifurcate (see Section HJ. Next, following ideas 
already introduced in the literature and more specifically here inspired by |22] , one 
introduces variational approximations (see Section [SJ . This procedure allows to 
select optimal transport plans with specific properties. These transport plans will 
eventually be proved to be induced by /x-measurable maps. This procedure is here 
essentially twofold. On one hand it allows to select optimal transport plans that 
are solution to a secondary variational problem. This secondary variational prob- 
lem prescribes the geometry of transport rays. The selected transport plans are 
indeed shown to be monotonic along transport rays (see Lemma I4.3[) . On the other 
hand, given a transport plan, one can in our context interpolate between its first 
and second marginal in a natural way (see Subsection I3.3[) . Absolute continuity 
and more importantly L°°-estimates on the density of the interpolations will play 
an important role and one can indeed prove L°°-estimates on the interpolations in 
the approximating variational problems (see Proposition 15 . 71) . These estimates rely 



lr The preprint [7] which appeared during the completion of the present paper also addresses 
Monge's transport problem in metric spaces with a geodesic distance using a Sudakov-type dimen- 
sion reduction argument and disintegration of measures. 
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on the so-called Measure Contraction Property of HP. In the limit one will even- 
tually get suitable L°°-estimates on the interpolations constructed from optimal 
transport plans selected through the variational approximation procedure. Next we 
note that some properties of plans with absolutely continuous first marginal proved 
in [12] can be easily generalized to our setting (see Section |6|). These properties 
are independent of the transport problem. They rely on the notion of Lebesgue 
points of functions and Lebesgue points of sets, notions which make sense for in- 
stance in any doubling metric measure space. Together with the above mentioned 
L°°-estimates on the interpolations, one can in particular prove density estimate 
on the transport set of selected optimal transport plans in the same way as in [T^] 
(see Section [7]). All together, namely combining this later density estimate on the 
transport set (Lemma 17. ip with Lemma 16.21 and remembering the monotonicity 
along transport rays (Lemma l4.3p it turns out that the selected transport plans are 
necessarily induced by a transport map as eventually proved in Theorem 18. 11 

The paper is organized as follows. In Section [2] we recall classical facts about 
optimal transportation for later use. In Section [3] we describe the Heisenberg group 
focusing on the features that will be needed in this paper. In Section |4] we prove 
geometric properties of optimal transport plans and the monotonicity along trans- 
port rays of solutions to the secondary variational problem. The variational ap- 
proximations are introduced and studied in Section [5] In Section [6] we state in 
our framework properties of plans with absolutely continuous first marginal proved 
in [12] in R™ . This section, independent of the transport problem, contains density 
results that play an essential role in the strategy followed here. In Section[7]we prove 
lower bounds on the density, in some suitable sense, of the transport set of optimal 
transport plans selected through the variational approximations. We conclude in 
Section [8] proving that the selected transport plans are induced by a transport map. 
We discuss in the final Section [9] some possible extensions of this approach to other 
spaces. 



We recall some well-known facts about optimal transportation confining ourselves 
to statements that will fit our needs in the rest of the paper. More general versions of 
these results hold in more general contexts. We refer to e.g. [25] and the references 
therein. 

Let (X,d) be a Polish space, i.e., a complete and separable metric space. We 
denote by V(X) the set of all Borel probability measures on X and by V C (X) the 
set of Borel probability measures on X with compact support. The weak topology 
we consider on V(X) is the topology induced by convergence against bounded and 
continuous test functions (or narrow topology). 

2.1. Kantorovich transport problem. Let fi, v € V(X). We denote by 



the set of all transport plans between /i and v. Here 7Ti, ■k-i : X x X — > X denote 
the canonical projections on the first and second factor respectively. 

Given c : X x X — > [0, +oo] a lower semicontinuous cost function, we look at 
Kantorovich transport problem between fj, and v with cost c: 



As a classical fact, existence of solutions to (|2.1I) follows from the weak compactness 
of II(/i, u) together with the lower semicontinuity of the functional to be minimized. 
We call them optimal transport plans. 



2. Preliminaries on optimal transportation 



LT(/z, v) := { 7 e P(X x A); (tti)^ = /m, {n 2 ) a = v} 



(2.1) 




4 



L. DE PASCALE AND S. RIGOT 



Cyclical monotonicity. We say that a set r C X x X is c-cyclically monotone if 

N N 

^2 c(xi,yi) < ^2 c(x i+1 , yi) 

i=l i=l 

whenever N > 2 and (xi,yi), . . . , (xn, tjn) G T. 

Theorem 2.1. Let 7 € n(/x, v) be an optimal transport plan and assume that 
J XxY c(x,y) dj < +00. Then 7 is concentrated on a c-cyclically monotone Borel 
set. 

Dual formulation - Kantorovich potentials. Let ip : X — >■ RU {—00}. We say 
that ip is c-concave if ip ^ —00 and if there exists ip : X — > R U {—00}, ^3 ^ —00, 
such that 

■0(a;) = inf c(a;,y) - ip(y). 
yex 

Theorem 2.2. In addition to the previous assumptions, assume thatc is real-valued 
and that 

V(x,y)eXxX, c(x,y)<a(x) + b(y) 
for some a G L 1 ^) and b G L (y). Then one has 



(2.2) min / c(x, y) d~f(x, y) — max / ip(x) d/j,(x) + / ip c (y) dv(y) 

ien(n,u) J XyX Jx Jx 

where the above maximum is taken among all c-concave functions ip and ip c {y) '■= 
inf xe x c(x, y) - ip(x). 

Definition 2.3 (Kantorovich potentials). We say that ip : X — > R U { — 00} is a 
Kantorovich potential if ip is a c-concave maximizer for the right-hand side of 



Theorem 2.4. With the same assumptions as in Theorem \2.2\ let ip be a Kan- 
torovich potential. Then 7 G n(/i, v) is an optimal transport plan if and only if 

c{x, y) = ip{x) + ip c (y) 7 — a.e. in X x X. 

We will use these results for various cost functions. In the particular case which 
is the core of this paper and where c(x,y) = d(x,y) and /z, v G V C (X), one can 
rephrase these results in terms of 1-Lipschitz Kantorovich potentials. More precisely, 
set 

Lip x (d) := {it : X ->• R; \u(x) - u(y)\ < d(x, y) Wx,yeX}. 

Theorem 2.5. Let /i, v G V C (X). Then one can find a Kantorovich potential 
11 G Lip^d) so that 



d(x,y)dj{x,y) = / u(x)dfi(x)- / u{y)dv{y) 
7en(/i,y) J xxx Jx Jx 

and 7 G v) is an optimal transport plan solution to Kantorovich transport 

problem (|2.1[) between fi and v with cost c(x,y) = d(x,y) if and only if 

u(x) — u(y) — d(x, y) 7 — a.e. in X x X. 

2.2. Transport problem. Let /j,, v G V(X). We say that a /i-measurable map 
T : X — > X is a transport map between \i and v if Tjj/z = 17 i.e., v{B) = /i(T _1 (B)) 
for all Borel set B. 

Given c : X x X — > [0, +00 [ a continuous cost function, we look at the transport 
problem between /1 and v with cost c: 



(2.3) min / c(x,T(x)) dp,(x). 

We say that a transport plan 7 G n(/i, ^) is induced by a transport if there exists a 
yu-measurable map T : X — >• X such that (/®T)j/i = 7 where {I®T){x) := (x, T(x)). 
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Such a map is automatically a transport map between \i and v. We also recall that 
if a transport plan 7 is concentrated on a 7-measurable graph then 7 is induced by 
a transport. 

Theorem 2.6 (Optimal transport plans versus optimal transport maps). 

(i) Assume that 7 is an optimal transport plan solution to Kantorovich transport 
problem (|2.1|) and that 7 is induced by transport T. Then T is an optimal transport 
map solution to the transport problem (|2.3|) . 

(ii) Assume that any optimal transport plan solution to Kantorovich transport 
problem (|2.1|) is induced by transport. Then there exists a unique optimal transport 
map solution to the transport problem (|2.3[) . 

3. Preliminaries on EP 

We consider the Heisenberg group H™ equipped with its Carnot-Caratheodory 
distance. Endowed with this distance H" is a polish geodesic and non-branching 
metric space and a doubling metric measure space when equipped with its Haar 
measure. 

3.1. The Heisenberg group. The Heisenberg group H" is a connected, simply 
connected Lie group with stratified Lie algebra. We identify it with C" x M equipped 
with the group law 

n 

[C, t] ■ [C, t'] := [C + C', t + t' + 2 TnCjtj] 

3=1 

where ( = (Ci> • • • , Cn), C = (Ci, • • • , O £ C" and t, t' e M. The unit element is 
and the center of the group is 

L:={[0,t] iel}. 

There is a natural family of dilations S r on H" defined by 5 r ([(,t}) :— [r(,r 2 i\. 
These dilations are group homomorphisms. 

We may also identify H™ with K 2n+1 via the correspondence [£, t] — (£, r], t) 
where £ = . . . ,£„), V = ( m , . . .,r?„) € R n , feR and ( = (Ci, . . . ,C«) € C™ with 
= + ■ The horizontal subbundle of the tangent bundle is defined by 

H := span{X, ; j = 1, . . . ,n} © span {Y, ; j = 1, . . . , n} 

where the left invariant vector fields Xj and Yj are given by 

A „ := % + -In.ih , Yj : ^ - 2^9 t . 

Vector fields in H will be called horizontal vector fields. Setting T := dt, the only 
non trivial bracket relations are [Xj,Yj] — —AT hence span{T} = [H,TL] and the 
Lie algebra H n of H™ admits the stratification H n =%@ span{T}. 

The Lebesgue measure C 2n+1 on H" w R 2n+1 is a Haar measure of the group. It 
is (2n + 2)-homogeneous with respect to the dilations, 

C 2n+1 (5 r {A)) = r 2B+2 £ 2n+1 (A) 

for all Borel set A and all r > 0. 

3.2. Carnot-Caratheodory distance. The Carnot-Caratheodory distance on H™ 
is defined by 

(3.1) d(x, y) — mi{length go ('-f); 7 horizontal C^-smooth curve joining x to y}, 

where a C 1 -smooth curve is said to be horizontal if, at every point, its tangent vector 
belongs to the horizontal subbundle of the tangent bundle and go is the left invariant 
Riemannian metric which makes (X\, . . . , X ni Y%, . . . , Y n ,T) an orthonormal basis. 
For a general presentation of Carnot-Caratheodory spaces, see e.g. [8], [19]. 
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The topology induced by this distance is the original (Euclidean) topology on 
H™ w (R 2n+1 ,(/ ) and (W n ,d) is a complete metric space. The distance is left 
invariant and 1-homogeneous with respect to the dilations, 

d(x ■ y,x ■ z) = d(y, z) and d(S r (y),S r (z)) — r d(y, z) 

for all x,y, z E H™ and all r > 0. It follows in particular that B(x, r) = x-S r (B(Q, 1)) 
and hence 

(3.2) C 2n+1 (B(x,r)) =c n r 2n+2 

for all x £ EP, all r > and where c„ := £ 2n+1 (.B(0, 1)) > 0. The measure C 2n+1 
is in particular a doubling measure on (M n ,d). For more details about doubling 
metric measure spaces, see e.g. [17] . 

Endowed with its Carnot-Caratheodory distance EP is a geodesic space, i.e., for 
all x, y E H™, there exists a curve a E C([a, 6],H") such that cr(a) = x, <r(6) = y 
and d(x,y) = 1(a) where 

2V-1 

i(cr) = sup sup V] d(o-(ti),cr(ij +1 )). 

Up to a reparameterization one can always assume that length minimizing curves 
a are parameterized proportionally to arc-length, i.e., 

d(a(s), a(s )) = u (s — s) 

for all s < s' s [a, 6], where w := d(a(a),a(b))/(b — a) is the (constant) speed 
of the curve. As a convention we will use throughout this paper the terminology 
minimal curves to denote length minimizing curves parameterized proportionally 
to arc-length. 

Definition 3.1 (Minimal curves). We say that a continuous curve a : [a,b] — > H" 
is a minimal curve if 1(a) — d(a(a),a(b)) and a is parameterized proportionally to 
arc-length. 

In general Carnot-Caratheodory spaces, issues about uniqueness and regularity 
of minimal curves between any two points as well as issues about the regularity of 
the distance function to a given point could be delicate. In the specific case of the 
Heisenberg group, equations of all minimal curves can be explicitly computed and 
exploited to overcome these difficulties. We recall below the description of minimal 
curves in H n , see e.g. [16], [5]. We set 

(3.3) n := {(x, y) E H™ x H"; aT 1 -y^L}. 

Theorem 3.2 (Minimal curves in H"). Minimal curves in H™ are horizontal C 1 - 
smooth curves such that the infimum in (|3.1j) is achieved. One has the more precise 
description: 

(i) Non trivial minimal curves starting from and parameterized on [0, 1] are 
all curves a XtV for some \ £ C" \ {0} and <p E [— 2-7T, 27r] where 

i x r , (e-^ s - l)x , ,2 <ps-sm(tps) 

°x,A s ) = b M — 2 ] 

if (p E [-27T, 27r]\{0} and 

ifip — 0. Moreover one has \\\ — d(0,a X:V (l)). 

(ii) For all (x, y) E £1, there is a unique minimal curve x ■ a x ^ between x and y 
for some x £ C" \ {0} and some Lp E (— 2-7T, 27r) and one has \x\ — d(x, y). 

(iii) // (x,y) ^ fl, x^ 1 ■ y — [0,t] for some t E M* , there are infinitely many 
minimal curves between x and y. These curves are all curves of the form 
x ■ <7 Xj 27r ift>0, x ■ er Xi _27r if t < 0, for all x E C™ such that \x\ — \/n\t\ ■ 
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Here and in the following, |x| = (E™=i \Xj?) 1/2 for X = (Xi, • • • ,Xn) 6 C". In 
particular it follows from this description that (H n ,d) is non-branching. 

Proposition 3.3 (Non-branching property of HP). The space (H n ,d) is non- 
branching, i.e., any two minimal curves which coincide on a non trivial interval 
coincide on the whole intersection of their intervals of definition. 

Equivalently for any quadruple of points z, x, y, y' 6 HP, if z is a midpoint of x 
and y as well as a midpoint of x and y' , then y = y'. 

The next lemma collects some differentiability properties of the distance function 
to a given point to be used later. For y <E HP, we set L y :=y ■ L. 

Lemma 3.4. Let y € HP and set d y (x) := d{x,y). Then the function d y is of class 
C°° on M n \L y (equipped with the usual differential structure when identifying HP 
with R 2n+1 ). Moreover one has 

(i) \V H d y {x)\ = 1 for all x € W l \L y where 

S7Hd y (x) := (Xidy(x) + iYid y (x), . . . , X n d y (x) + iY n d y (x)). 

(ii) // Vd y (x) = Vdyi(x) and d(x,y) = d{x,y') for some x 6 W l \{L y U L y >), 
then y = y 1 . Here V = (d^ , • • • , d^ n , d Vl , • • • , d Vn , dt) denotes the classical 
gradient when identifying HP with M 2 ™ +1 . 

Proof. Set $(x, <p) := °~x-v(^) wnere a x,f IS gi yen m Theorem 13.21 This map is a 
C°°-diffeomorphism from C" \ {0} x (-2tt, 2tt) onto H" \ L (see e.g. [20], [5], [18]). 
If x = $(x, <p) £ H" \ L with (x, ip) eC n \ {0} x (-27T, 2tt), one has d (x) = \x\ and 

V H d (x) = -j — j-e lv and d t do(x) = —— , 

see [5] Lemma 3.11]. Next, by left invariance, we have d y (x) = do(y~ 1 ■ x), 
V ' H<ly{x) = V^d (y~ 1 • x ) an d d t d y (x) — d t d (y^ 1 ■ x) if x g HP\L y and the 
lemma follows easily. □ 

3.3. Interpolation between measures. The notion of interpolation constructed 
from a transport plan between any two measures will be one of the key notion to 
be used later. To define it in our geometrical context, we first fix a measurable 
selection of minimal curves, i.e., a Borel map S : HP x H" — > C([0,1],H") such 
that for all x, y € H™, S(x, y) is a minimal curve joining x and y. The existence of 
such a measurable recipe to join any two points in HP by a minimal curve follows 
from general theorems about measurable selections, see e.g. [25J Chapter 7]. Next 
we set e t (cr) := a(t) for all a e C([0, 1],EP) and t E [0,1]. In particular e t (S(x,y)) 
denotes the point lying at distance t d{x, y) from x on the selected minimal curve 
S(x, y) between x and y. 

Definition 3.5. Let [i, v G T'(HP) and let 7 S IT(/^, v). The interpolations between 
/i and v constructed from 7 are defined as the family ((et o S')jj7)) te ro j i] of Borel 
probability measures on HP. 

Note that these interpolations depend a priori on the measurable selection S 
of minimal curves. This is actually not a serious issue for our purposes. We will 
moreover always consider interpolations constructed from transport plans that are 
concentrated on the set D, on which S(x, y) is nothing but the unique minimal curve 
between x and y. Note also for further reference that S\_n is continuous. 

3.4. Intrinsic differentiability. Intrinsic differentiability properties of real- valued 
Lipschitz functions on HP, namely a Rademacher's type theorem, will be useful 
when considering 1-Lipschitz Kantorovich potentials. This theorem is a particular 
case of a more general result due to P. Pansu. We say that a group homomorphism 
g : HP — » K is homogeneous if g(S r (x)) — r g{x) for all x £ HP and all r > 0. 
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Definition 3.6. We say that a map / : H" — > M is Pansu-differentiable at x G H ra 
if there exists an homogeneous group homomorphism g : H" — > K such that 

lim fM-fW-S^-v) = o. 
y-Hc d(y, x) 

The map g is then unique and will be denoted by Dnf(x). 

If / : H" — > K is Pansu-differentiable at x G H™ then the maps s h-> /(a; • 
<5 s [ej,0]), resp. s i— > /(a; • S s [e n + j , 0] ) , are differentiable at s = and if we denote 
the corresponding derivatives by Xjf(x), resp. Yjf(x), then 

TO 

D H f(x)(Z, V ,t) = ^ j X j f(x) + Vj Y j f(x). 

3=1 

Here e 3 = (#[,..., G C" and e n+J = . . . , i<5£) £ C". Using similar 
notations as in the classical smooth case, we then set V fif( x ) '■= {X\f(x) + 
iY 1 f(x),...,X n f(x) +iY n f(x)). 

Theorem 3.7 (Pansu-differentiability theorem). [21] Let f : (H n ,d) ->• K be a 
C-Lipschitz function. Then, for C 2n+1 -a.e. x G H", the function f is Pansu- 
differentiable at x and |Vi//(a;)| < C. 

The next lemma will be used to prove that any optimal transport plan is con- 
centrated on the set O. 

Lemma 3.8. Let u G Lip^d), x G H™ be such that u is Pansu-differentiable at x 
with |V ' hu{x)\ < 1 and let y G H n be such that u(x) — u{y) = d{x,y). Then there 
exists a unique minimal curve between x and y. 

Proof. Let a : [0, 1] — > H™ be a minimal curve between x and y. Then a is a 
horizontal C 1 -smooth curve and if a(t) = ((Ji(t), . . . , <72n+i(0) G ^™ ~ R 2n+1 , one 
has for all t G [0, 1], 

TO 

3=1 

and |<7h(*)| =d(x,y) where <th(*) := (oi(*) + £<r n+ i(t), . . . , & n (t) + i &2n(t)) G C". 
On the other hand, one has 

u(x) — u(a(t)) — d(x, cr(t)) = t d{x, y) 

for all t G [0, 1]. Differentiating this equality with respect to t, we get 

Oj-(0) Xju(x) + & n +j{0) Y jU (x) = — u(a{t)) u=0 = -d(x, y). 

3=1 

All together, it follows that 

n 

d(x,y) = \ J2<7j(0)Xju{x)+a n+j (0)Y jU (x)\ < \\7 H u{x)\ \& H (0)\ < d(x,y). 

3 = 1 

In particular, there is equality in all the previous inequalities which implies in turn 
that &h(0) = — d(x,y)V hu(x). On the other hand one knows from Theorem 13.21 
that a = x ■ a x ^ for some x G C™ \ {0} and ip G [— 2tt, 2tt]. In particular one has 
o~h(Q) = X- It follows that x = — d(x, y) V hu(x) is uniquely determined hence there 
is a unique minimal curve joining x and y according once again to the description 
given in Theorem 13.21 □ 
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4. Properties of U.i(p,i/) and U 2 (h, v) 

Let fi, v £ ^(l") be fixed. We denote by H\(n, v) the set of optimal transport 
plans solution to Kantorovich transport problem (|2.1I) between fi and v with cost 
c{x,y) = d(x,y). 

We first prove some geometric properties of optimal transport plans. These 
properties follow from the behavior of minimal curves in (BP, d). In the next lemma, 
we prove that any optimal transport plan is concentrated on the set fi (see (\'3M ) 
of pair of points that are connected by a unique minimal curve. 

Lemma 4.1. Let 7 elli(^,^) and assume that /1 <C £2n+i Then for a. e. (x,y), 
there exists a unique minimal curve between x and y. 

Proof. Let u £ Lip x (d) be a Kantorovich potential associated to Kantorovich trans- 
port problem (|2.1[) between /1 and v with cost c(x,y) — d(x,y) (see Section [2] and 
Theorem 12.51 there) . Since u £ Lip 1 (d), we know from Theorem 13 . 71 that for £ 2 ™+ 1 _ 
a.e., and hence /x-a.e., x £ HP, u is Pansu-differentiable at x with \V hu(x)\ < 1. 
Then the conclusion follows from Lemma 13.81 since u(x) — u(y) = d(x,y) for 7-a.e. 
(x,y) (see Theorem l2.5|) . □ 

The next lemma says that minimal curves used by an optimal transport plan 
cannot bifurcate. It follows essentially from the non-branching property of (HP , d) . 

Lemma 4.2. Let 7 £ IIi(/i, v). Then 7 is concentrated on a set T such that the 
following holds. For all (x, y) £T and {x' , y') £ V such that x ^ y and x ^ x' , if x' 
lies on a minimal curve between x and y then all points x, x 1 , y and y' lie on the 
same minimal curve. More precisely, there exists a minimal curve a : [a, b] —> HP 
such that x = o~(a), y — a(t) for some t £ (a, b], x = cr(s) for some s £ (a,t] and 
y' = cr(t') for some t' £ [s, b]. 

Proof. Let (x, y) £ IP x H" and (x', y') £ H™ x H™ such that x + y and x' ^ x. 
Assume that x' £ cr((0, d(x, y)]) where a : [0,d(x,y)] —> H™ is a unit-speed minimal 
curve between x and y. Let a' be a unit-speed minimal curve between x' and y' 
parameterized on [d(x, x'),d(x, x') + d(x', y')]. Assume moreover that 

d(x, y) + d(x', y') < d{x, y') + d{x',y). 

Recall that this holds true for 7-a.e. (x,y) and (x',y r ) by Theorem 12.11 Then the 
curve a : [0, d(x, x') + d(x',y')] — > H™ which coincides with a on [0, d(x, x')} and 
a' on [d(x, x 1 ), d(x, x') + d(x',y')] is a length minimizing curve between x and y' . 
Indeed, otherwise we would have 

d{x, y') < 1(a) = l{a l[0id{Xt3 . n] ) + l{a' [d{x x , ) d(x <x , )+d{x , <yl)] ) = d(x, x') + d(x', y'). 

Since x' lies on a minimal curve between x and y, we have d(x, x')+d(x' , y) = d(x, y) 
and we get 

d(x, y 1 ) + d(x', y) < d(x, y) + d(x', y') 

which gives a contradiction. It follows that a and a are unit-speed minimal curves 
that coincide on the non trivial interval [0, d(x, x')\. Since HP is non-branching (see 
Proposition 13 . 3[) . this implies that a and a are sub-arcs of the same minimal curve, 
namely a if d(x,y') < d(x,y) and a otherwise, on which all points x, x', y and y' 
lie. And the conclusion follows. □ 

We denote by ^(t 1 , v) the set of transport plans solution to the secondary vari- 
ational problem: 

min / d(x,y) 2 dy(x,y). 
7eriiUi,i/) Ji»xi» 

Optimal transport plans selected through the variational approximations to be 
introduced in Section [5] will be solution to this secondary variational problem. The 
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next lemma gives a one-dimensional monotonicity condition along minimal curves 
used by optimal transport plans in Il2(/i, v). This follows essentially from a con- 
strained version of cP-cyclical monotonicity. 

Lemma 4.3. Let 7 £ n 2 (/i, v). Then 7 is concentrated on a set T such that the 
following holds. For all {x, y) £T and (x', y') £ T such that x ^ y and x ^ x' , if x' 
lies on a minimal curve between x and y then all points x, x' , y and y' lie on the 
same minimal curve ordered in that way. 

In other words, there exists a minimal curve a : [a, b] —> H n such that o~(a) = x, 
<j{t) = y for some t £ (a, b], a(s) — x' for some s £ (a,t] and o-(t') = y' for some 
t' £ [t,b]. 

Proof. First, as a classical fact, one can rephrase the secondary variational prob- 
lem as a classical Kantorovich transport problem (|2.ip between fj, and v with cost 
c(x,y) = f3(x,y) with 



P{x,y) = 




if u(x) - u{y) = d(x,y), 
otherwise, 



where u £ Lip 1 (d) is a Kantorovich potential associated to Kantorovich transport 
problem (|2.1[) between /j, and v with cost c(x,y) = d(x,y) (see Section [5] and Theo- 
rem [2T5] there). Since /3 is lower semicontinuous and J H „ xH „ f3(x, y) d'y(x, y) < +00 
for all 7 £ Il2(/z, v), it follows from Theorem 12.11 that any 7 £ ^(Hjv) is con- 
centrated on a /3-cyclically monotone set. So, taking into account the fact that 
Il2(//, v) C IT v), we know that 7 £ Il2(/i, v) is concentrated on a set T such 
that 

u{x) - u(y) = d(x,y) 

for all (x, y) £ T, 

f3(x,y) + 0(x',y , )<l3(x,y') + (3(x',y) 

for all (x, y) £ T and (x', y') £ T and the conclusion of Lemma H . 2 1 holds . 

Then let (x,y) £ T and (x',y') £ T be as in the statement. By Lemma [4.21 
the conclusion will follow if we show that d(x',y) < d(x',y'). First we check that 
P(x', y) — d(x\ y) 2 and /3(x, y') = d(x, y') 2 . We have 

u{x) < u(x') + d(x, x') < u(y) + d(x' , y) + d(x, x') — d(x, y) + u(y) = u(x) 

hence all these inequalities are equalities. In particular, we get that u(x') = u{y) + 
d(x', y) hence f3(x', y) = d(x' , y) 2 . We also get that 

u(x) — d(x, x') + u(x') — d(x, x') + u{y') + d(x' , y') — u(y') + d(x, y') 

hence f3(x, y') = d(x, y') 2 . If d(x' , y 1 ) < d(x' , y), we get 

p(x,y')+/3(x',y) - p(x',y') - p(x,y) 

= d(x, y') 2 + d(x', y) 2 ~ d(x', y') 2 - d(x, y) 2 

= (d(x, x') + d{x', y')f + d(x', y) 2 - d(x', y') 2 - (d(x, x') + d(x' , y)) 2 
= 2 d(x, x')(d(x',y') - d(x', y)) < 
which gives a contradiction. □ 



5. Variational approximations 

We introduce variational approximations in the spirit of j4] (see also [10], [3]) 
by rephrasing in our geometrical context the variational approximations consid- 
ered recently in |22j . This approximation procedure will be used to select optimal 
transport plans that will be eventually proved to be induced by transport maps. 
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Let fi, v G V c (M n ) be fixed. Let if be a compact subset of H™ such that 
spt /it U spt u C K and set 

LT := {7 G V(W l x H n ); (7^7 = fx, spt (^ 2 ) s7 C K}. 

For e > fixed and 7 G II, we set 

C £ (j) := - Wi((w 2 )tJ,u) + / d(x,y)dy(x,y) 

£ Ji"xl" 

+ e d(x, yf d~f(x, y) + e 6n+s card (spt (n 2 )a) 

JH n xB n 

and consider the family of minimization problems: 
(P s ) min{C £ (7); 7 G n}. 

Here Wi denotes the 1-Wasserstein distance defined for any two probability mea- 
sures m, fj.2 G V(M n ) by 

Wi(Lii,fj, 2 ) := miii / d(x,y)drf(x,y). 
7en(/n,/i 2 ) Jh>« x h" 

First we note that ( |P £ [ ) always admits solutions. 

Theorem 5.1. For any e > 0, £/ie problem Q_P e | ) admits at least one solution and 
min{C 6 (7); 7 G 11} < +00. 

Proof. First note that since K is compact, C e (7) < +00 for any 7 G II such that 
(^2) j 7 is finitely atomic. Next the existence of solutions to ( |P £ [ ) follows from the 
weak compactness of II, the lower semicontinuity of the three first terms to be 
minimized and the Kuratowski convergence of the supports of weakly converging 
probability measures (see (2J Chapter 5]). □ 

Next, weak limits of solutions to fl-P e D are optimal transport plans that are solu- 
tions to the secondary variational problem introduced in Section [4] to which we refer 
for the definition of Il2(/x, v). Modulo minor modifications due to our geometrical 
context, this can be proved with the same arguments as those given in [22] . 

Lemma 5.2. Let Sk be a sequence converging to and r y £k a sequence of solutions 
to (P ek ) which is weakly converging to some 7 G V(M n x H"). Then 7 G II 2 (/x, v). 

Proof. First we note that for any m > 1, one can find a finite set F m C K such that 
card F m < C m 2n+2 for some constant C > which depends only on n and diam K 
and a Borel map p m : K — > F m such that 

d(p m (x),x) < 1/m 

for all x G K. Indeed choose x\ G K. For i > 2, choose by induction x\ G 
K\L)j < iB(xj, 1/m) as long as K\{Jj <i B{xj, 1/m) ^ 0. Let F m denote the set of all 
these points. The balls B(xi, l/(2m)) are mutually disjoint and UiB(xi, (l/(2m)) C 
B(xi, diam K + 1). Remembering f|3 . 2[) . it follows that 

c„(2m)- 2 "- 2 card J F 1 < C 2n+1 (U l B(x l ,l/(2m))) 

< C 2n+1 (B( Xl ,diamK + 1)) = c„(diam^ + l) 2n+2 

for any finite subset F C F m , hence F m is a finite set with cardi 7 ^ < C ' m 2n+2 
where C depends only on n and diam K. Next, by construction, for any x G K, 
there exists a unique Xi G F m such that x G B(xi, 1/m) \ \Jj <i B{xj, 1/m) and we 
then set p m {x) :— Xi. 

The proof of the lemma can now be completed following the same arguments as 
those in [22]. For sake of completeness, we sketch these arguments below. Let 7 £fc 
and 7 be as in the statement. For m > 1, one sets v m := (pm)$v. Note that by 
construction of F m and p m , one has card (spt v m ) < C m 2n+2 and W\{v m , v) < 1/m. 
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To check that 7 G Il(p,v), one takes some j m G II(^, v m ) C II and uses the 
optimality of j Sk which implies 

Wi((7r 2 )|t7£ fe ,^) < £kC ek (j m ) 

< i- + s k diam(K) + e\ diam(X) 2 + C e e k n+9 m 2n+2 . 

Then one lets — > with to > 1 fixed and then m -4- +00 to get that (7r2)j7 efc 
converges weakly to ^. Since it also converges weakly to (772)07, it follows that 

7 e IL(fi,u). 

To check that 7 G IIi(^, ^), one takes 7 G IIi(^, v) and sets 7 m := (Id,p m )$-f G 
n(/x,z^ m ) where (Id,p m )(x,y) = (x,p m (y)). By optimality of -f £k , one has 

/ rf(a;,y)rf7 £fc (a;,y) < C ek ^f m ) 
Jn n xm n 

<—+ f d{x, y) dj m (x, y) + e k diam(X) 2 + C e 6 k n+8 m 2n+2 . 
Choosing m of the order of s k 2 and letting Sk — > 0, one gets 

/ d(x, y) d-y(x, y) < I d(x,y)drj(x,y) 

Jh^xH" il"xl" 

hence 7 G IIi(/i, v). 

Finally, to check that 7 G n 2 (/i, v), one uses once again the optimality of j £k in 
the following way, 

^((^fy^) +£fc / d(x,y)d'y ek (x,y)+el d(x,y) 2 dj Ek (x,y) 

JH"xI" JI"xI" 

< £fcCe fc (7 m ) 

On the other hand, one has 

< / d(x,y)d-f ek (x,y) + W 1 {{n 2 )ae k ,v) 

and 

/ d{x,y)drf m (x,y) = d(x,p m (y)) <Py(x,y) 

JH"xH" Jl"xl" 

< / d(x,y)drf(x,y) + d(y,p m (y))drf(x,y) 

JI"Xl™ Jl"Xl" 

< W 1 {p,v) + —. 

to 

It follows that 

/ d{x,y) 2 dj Ek (x 7 y) 

JM n xl" 

d(x,y) 2 drf m (x,y) + Cei n+7 m 2n+2 



1 

< — 



2 n r / 

me fe mefc Jh" xH" 



provided £u < 1. Choosing to of the order of e k 3 and letting -» 0, one gets 

/ d(x,y) 2 d-y{x,y) < / d(x,y) 2 drf(x,y). 
Since 7 G IIi(/x, ^) was arbitrary, it follows that 7 G n 2 (^, ^). □ 
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The rest of this section is devoted to the study of the solutions to (P e ). We fix 
e > and set 

c £ (x, y) := d(x, y) + e d(x, y) 2 . 
We first recall the following classical fact. 

Lemma 5.3. Let 7 £ be a solution to (P £ ). Then for any Borel set U C H n x 
Et"; ( 7r 2)|)(7e L^O is finitely atomic and r y £ \U is a solution to Kantorovich transport 
problem (|2.ip between (7Ti)jt(7 £ \U) and (7T2)j(7 e L^O with cost c e . 

Proof. The fact that (^2)0 (7^ \U) is finitely atomic obviously follows from the fact 
that C £ (j e ) = min{C e (7); 7 G 11} < +00. Next it is also immediate that j e is a so- 
lution to Kantorovich transport problem (|2.ip between [i and {^2)t,{le) with cost c e . 
Then as a classical fact, the claim follows from the linearity of the functional to be 
minimized with respect to the transport plan. If 7 e n((7r 1 )j(7 £ \U), (7r 2 )jj(7 e [U)), 
one indeed simply compare C e (7 e ) with C e (7) where 7 = 7 £ [(EI™ x H") \ U + 7 € 
n(/x, (7r 2 )o(7 e )) to get the conclusion. □ 

Next in this section we consider interpolations between two measures /Z, V G 
■p c (H™) that are constructed from a transport plan solution to Kantorovich trans- 
port problem (|2.1I) between these two measures with cost c e . We prove absolute 
continuity and, more importantly, L°°-estimates on the density with respect to 
£2n+i Q f thggg interpolations whenever ~p <C £} n + 1 anc ] 77 j s finitely atomic, see 
Proposition 15.71 We divide the arguments into several steps. First we prove that 
any solution to this Kantorovich transport problem is induced by a transport. 

Theorem 5.4. Let ~p, v € 7 ? c (BI n ) be fixed. Assume that ~p <C C 2n+1 and that v is 
finitely atomic. Then any solution to Kantorovich transport problem (|2.1I) between 
~p and v with cost c £ is induced by a transport. In particular there exists a unique 
optimal transport map solution to the transport problem (12.31) between ~p and V with 
cost c e . 

Proof. Let ijj be a Kantorovich potential for Kantorovich transport problem (|2.ip 
between ~p and V with cost c e given by Theorem 12.21 Let denote the atoms 

of v. We prove that for £ 2n+1 -a.e. x e H", there is at most one point yi for some 
i e {1, . . . , k} such that 

ip(x) +ip c (yt) = c e (x,yi). 
Since ~p < C 2n+1 , it will follow that any transport plan solution to Kantorovich 
transport problem (|2.1[) between ~p and v with cost c e is concentrated on a JL- 
measurable graph and hence induced by a transport. This implies in turn existence 
and uniqueness of the optimal transport map solution to the transport problem 
(|2.3p between "p and V with cost c e (see Theorem l2.6p . 

For i ^ j, set h v] (x) := c e (x,yi) - c s (x,yj) + ip c (yj) - ip c (yi). It follows from 
Lemma OH] that hij is of class C°° on the open set H n \ (L yi U L y .) with V/ijj 7^ 0. 
Indeed assume on the contrary that Whij(x) = for some x € H" \ (L yi U L yj ). 
Then, differentiating along the horizontal vector fields Xj and Yj , we would have 

V H d Vl (x) (1 + 2e d w (a:)) = V H d yj (x) (1 + 2e d Vj (x)). 

Since \V Hd yi (x)\ = \X/ nd Vj (x)\ (see Lemma l3.4f i)). this would imply that d Vi {x) — 
d Vj (x) and in turn that Vndy^x) — V Hd yj (x). Since we also have by assumption 
d t d Vi {x) = d t d yj (x), Lemma [BTW ii) would give yi = yj. It follows that the set 
{x G H™ \ (L Vi ULyA; hij(x) = 0} is a C°°-smooth submanifold of dimension 2n in 
R 2rl+1 and hence has Lebesgue measure 0. Since C 2n+1 (L yi ) — 0, it follows that 

£2n+l(jJ{ a; g H n. h ,.^ = }) = 

and the claim follows. □ 
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If T : H" -> HP, we set T t = e t o S o (I ® T), i.e., T t (a;) is the point lying at 
distance t d(x,T(x)) from a; on the selected minimal curve S(x,T(x)) between x 
and T(x) (see Subsection 13.31 for the definition of S and e*). 

Proposition 5.5. [25j Chapter 7] Let ~p, V G P C (HP) be fixed such that ~fl <C 
£2n+i ^ j s fi n n e iy atomic. Let T e be the optimal transport map solution to 
the transport problem (|2.3[) between ~p and V with cost c e . Then there exists a ~p 

- measurable set A such that ~p(A) = 1 and such that for each t G [0, 1), T^\_a is 
injective. 

The cost c e can be recovered as coming from a so-called coercive Lagrangian 
action. Since (HP,cf) is non-branching, the proposition essentially follows from [25J 
Chapter 7, Theorem 7.30]. However one does not need the full strength of the 
theory developed in [35J Chapter 7] to get the conclusion of Proposition (|5.5[) and 
we sketch below the arguments for the reader's convenience. 

Proof. Let < s < t < 1 and x, y G ET™. Set 

c°/(x,y) = d(x,y)+e^£. 

t — s 

The space (HP, d) being a geodesic space and !ih>m + £ii 2 being strictly increasing 
and strictly convex on [0, +oo), one has 

c e (x,y) < c°/(x,z) + c t £ 1 (z,y) 

for all x, y, z G HP and t £ (0,1), with equality if and only if any curve in 
C([0, 1],H") obtained by concatenation of a minimal curve a XtZ : [0,t] — > HP be- 
tween x and z and a minimal curve a z ^ y : [t, 1] — > HP between z and y is a minimal 
curve between x and y. 

On the other hand, by encyclical monotonicity, one knows that there exists a ~p 

- measurable set A such that ~p(A) = 1 and 

c £ (x, T e (x)) + c £ (x, T E (x)) < c e (x, T £ {Z)) + c £ (x, T e {x)) 

for all x, x G A (see Theorem 12. II) . 

Now let t £ (0, 1) be fixed and let x, x G A. Assume that T^(x) = T t E (x). Then 

(5.1) c e (x,T%x))<c /(x,z) + cl' 1 (z,T e (x)), 

and similarly, 

c £ (x,T s (x)) < c°/(x,z)+c t £ ' 1 (z,T e (x)) 
where z = T t E (x) = T t £ (x). It follows that 
c £ (x,T s (x))+c £ (x,T e (S:)) 

<c £ {x,T e {£))+c £ (x,T e (x)) 

< c £ '\x,T^x))+cl'\T^x),T^x))+c £ '\x,Tai))+cl'\T^i),T%i)) 
= c £ (x,T E (x))+c £ (x,T £ (x)). 

Hence equality has to hold in all these inequalities. In particular equality holds 
in (|5.1|) . It follows that the curve obtained by concatenation of the minimal curve 
s G [0, i] i — v e s (S(x,T E (x))) between x and z with the minimal curve s G [t, 1] M> 
e s (S(x, T e (x))) between z and T e (x) is a minimal curve. Since this curve coincide 
with the minimal curve a : s G [0, 1] i— > e s (S(x,T e (x))) on the non trivial interval 
[0, t] and HP is non-branching (see Proposition ^. 3|) . we get that it coincides with a 
on the whole interval [0, 1]. Similarly, it coincides with the minimal curve a : s G 
[0, 1] i — y e s (S(x, T E (x))) on the whole interval [0, 1]. Hence a — a and in particular 
x = o-(0) = a(0) = x. ' □ 

We turn now to the main estimate that will lead to Proposition 15. 71 
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Proposition 5.6. Let ~p e V{W l ), JI < £ 2n+1 ; JI = pdC 2n+1 , and T : BP -)• HP 

be a JI - measurable map such that T^JJ is finitely atomic. Let t 6 (0, 1) and set 
JI t := T t $JI. Assume that there exists a JI - measurable set A such that ~p(A) = 1 
and T t [a is infective. Then ~p t <C C 2n+1 , JI t = pt dL 2n+1 with 

P* ^ (1 „ t )2r t +3 ^ or * Lr t (A) C + -a.e. 

Arguments for the proof of this proposition can be found in |15[ Section 3] even- 
though not explicitly stated in the same way in that paper. They rely on the 
following estimate: 

C 2n +\E) < (1 J )2n+3 C 2 ^{{e t oS){E,y)) 

for any y S HP and E C HP which is proved in |18[ Section 2] and which roughly 
means that (HP, d, C 2n+1 ) satisfies a so-called Measure Contraction Property. We 
detail the proof below for the reader's convenience. 

Proof. Let {yi}^ =1 denote the atoms of T$p. Set A, = T _1 ({yj}) C\A and A = U^A*. 
The sets Aj are mutually disjointed and ~p{A) — 1 by hypothesis. For any x G A^, 
T t (x) = (e t o S)(x, yi), hence 

£ 2n+1 (E) < (1 _* )aw+3 £ 2 " +1 (r t (^)) 

for any E C Aj. Next if E C A, writing E = Ui(E D Aj) where the sets Aj are 
mutually disjointed and remembering that Tt is injective on A C A by hypothesis, 
one gets 

C 2n+1 (E)< {i _\ )2n+3 C 2 ^(T t (E)). 
It follows that for any Fcl", 

c 2n+1 (T t - 1 (F)nA) < (1 _ f 1 )2n+3 £ 2 "+ 1 (Fnr t (i)). 

Assume that F C HP is such that £ 2n+1 (F) = 0. We get jC 2n+1 {T t r 1 (F)nA) = 
from the previous inequality. On the other hand ~p t {F) — Jl(Tf (F)) = Jl(T t T 1 (F) n 
A) hence /I t (F) = since p « £ 2n+1 and it follows that JI t <C £ 2n+1 . 

Next, to prove the estimate on the density of ~p t with respect to £ 2n+1 , we note 
that the inequality above implies that 

for any non negative measurable map g : HP — > [0, +oo]. Let h : i" — >• [0, +oo] be 
a non negative measurable map and set 

g(x) = l Tt(A) (x) h(x) piT^l^^ix)). 

Then 

-2n+l/ \ ^ \ / ./ \ .(rp-X\ . ^'iU/ , 2»>+l/ 



S^Or))^ 2 ^) < - / /i(a;) p(Tf l Tt (A) ( x )) d£ (x) 

U - IJ JTt(A) 

other hand 

( g(T t )dC 2n+1 = [ h(T t )pd£ 2n+1 = f h(T t )dJl= f hdji t , 



hence 



h(x)dp t (x)< / _ h(x)p(T t - 1 [ Tt{A) (x))dC 2n+1 (x). 

{ L l ) JT t (A) 

Remembering that A C A, this concludes the proof. □ 
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Finally, combining Theorem 15.41 Propositions 15.51 and 15.61 we get the following 
proposition which gives the absolute continuity of the interpolations together with 
an L°°-estimate on their density. Note that if -f e is the transport plan solution 
to Kantorovich transport problem (I2.1[) between ~p and V with cost c e and T e the 
optimal transport map solution to the transport problem (|2.3j) between ~p and v 
with cost c^, which hence induces j e , then (et ° S)^ E = T^Ji. 

Proposition 5.7. Let ~pZ, V £ "P C (1HP) be fixed. Assume that ~p <C £ 2n+1 with 
~p = pdC 2n+1 and V is finitely atomic. Let j e be the transport plan solution to 
Kantorovich transport problem (|2.1[) between Jl and V with cost c e . Then for any 
t £ [0, 1), the interpolation (etoS)^ e is absolutely continuous with respect to C 2n+1 , 
(e t o 5)j7 e = p\ dC 2n+1 , and one has 



1 



\fth-< „ ^ n+3 \\p\\ 



(1-t) 



L° 



6. Properties of measures 7 £ 7>(HP x HP) with (7ri)j7 < £ 2n+1 

This section is independent of the transport problem. We state some properties 
of measures in 7-* (HP x H n ) with first marginal absolutely continuous with respect 
to C 2n+1 . These properties are essential steps in the strategy adopted here to solve 
Monge's transport problem. They are the exact counterpart in the framework of 
(H ra , d, C 2n+1 ) of similar properties proved in in IP. These properties hold 
actually true in more general settings, for instance in any separable doubling metric 
measure space. 

We first recall some facts about Lebesgue points of Borel functions and density of 
absolutely continuous measures. Since the measure £ 2n + 1 is a doubling measure on 
(H",d), see (pT2]l, if p : H" -> [0, +00] is a £ 2 " +1 -locally summable Borel function 
then for £ 2n+1 -a.e. x £ HP, one has 

(6-1) lim, ,^ * / \p(y)- P (x)\dC 2n+1 (y)=0, 



* &»+HB(x,r)) J B(x<r) 

see e.g. [T7]. A point x £ H™ where (|6.ip holds is called a Lebesgue point of p and 
we denote by Leb p the set of all Lebesgue points of p. 

In rest of this paper, and especially in the lemma to follow, it will be technically 
convenient to consider the density p of an absolutely continuous measure p <C C 2n+1 
as a £, 2n+l -summable Borel function, i.e., a function well-defined everywhere, so 
that one can speak about its Lebesgue points and its value at any arbitrary point 
without any ambiguity. If p, £ V(M. n ), we set 

(6.2) p(a):=]j ™ p ______ 

This map p : HP -> [0, +00] is a Borel map and if p, < C 2n+1 then p = pdC 2n+1 . 
By a slight abuse of terminology, when speaking about the density of an absolutely 
continuous measure p £ 'P(HP) with respect to C 2n+1 , we will thus always refer in 
the following to the Borel function p defined above. 

The next lemma will be an essential ingredient in the proof of Lemma 17.11 

Lemma 6.1. Let 7 £ P(M n x HP) be such that (tti) S 7 < C 2n+1 . Then 7 is 
concentrated on a set T such that, for all (x,y) £ T and all r > 0, there exist 
y' £ HP and r' > such that 

(i) y£B(y',r') d£ B(y,r), 

(ii) x £ Leb p and p(x) < +00, 
(iii) x £ Leb p' and p'(x) > 0, 

where p denotes the density 0/ (tti )tt7 and p' the density of {'Ki)^y\JW l x B(y',r')) 
with respect to C 2n+1 . 
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Proof. Let [y m )m>i be a dense sequence in H". For each m, k £ N*, set j mt k := 
7L(H™ x B(y m ,r k )) where r k := 1/k. Let p m ,k denote the density of (iri)$j mtk with 
respect to C 2n+1 . Set A m , k := H" \ (Lebp n Lebp m . fc n {p < +00}). We have 
C 2n+1 {A mM ) = 0. Since (tti)j7 < C 2n +\ it follows that 7 (A m , fc x B(y m ,r k )) < 
(7r 1 ) tt7 (A„ i!fc ) = 0. Next 

l{{Pm,k = 0} x B(y m , r k )) = (nx)^ mi k({Pm,k = 0}) = 0. 

It follows that 7(-D m ,fe) = for all to, k £ N* where 

D m ,fc := [H™ \ (Leb p n Leb p m<k n {p < +00} n {p m<k > 0})] x B(y m , r k ) 

hence 7(U mj tI> m i) = and 7 is concentrated on H™ \ Ll m ,kDm,k- Then the conclu- 
sion follows noting that for each (x, y) £ H™ x H™ and r > 0, one can find m, fc £ N* 
such that y € B(y m ,r k ) CC B(y,r). □ 

We say that cc £ E is a Lebesgue point of a Borel set E 1 if x £ LebflE, i.e., if 
x E E and 

£ 2w+1 (£ng(a;,r)) _ 
rTo £ 2 «+i(B(x,r)) ~ ' 

and we denote by Lebi? := Leb 11b the set of all Lebesgue points of E. Note that 
C 2n+1 (E\LebE) = 0. 

The next lemma together with Lemma 17.11 and Lemma 14.31 is one of the key 
ingredients of the proof of Theorem 18 . 1 1 and eventually of the existence of a solution 
to Monge's transport problem. It can be recovered as a consequence of Lemma 16. II 
However, for sake of clarity, we state and prove it independently. 

Lemma 6.2. Let 7 £ V(W l x H") be such that (t^)^ < C 2n+1 . Assume that 7 is 
concentrated on a a-compact set F. For y £ HP and r > 0, set 

r- 1 (5(y,r))= 7 ri(rn(H" xB(y,r))). 

Then T (B(y,r)) is a Borel set and 7 is concentrated on a set T' C F such that 
for all (x,y) £ T' and all r > 0, x £ LebF~ 1 (S(y,r)). 

Proof. Since T is cr-compact, T^ 1 (B(y, r)) is also a-compact hence a Borel set. Set 
A := {{x,y) £ T; x £ LebT~ 1 (B(y, r)) for some r > 0} and let us show that j(A) = 
0. For each k £ N*, consider a countable covering of H™ by balls (B(yf ,r k )) i>i of 
radius r k := l/(2k). If (x,y) £ T and x ^ Lebr _1 (B(y, r)) then for any k > 1/r 
and y\ such that d(yf,y) < r k , one has x £ T^ 1 (B(y^, r k )) \ LebT^ 1 (B(yf ,r k )). It 
follows that 

fe>i i>i 

The set on the right-hand side has £ 2n+1 -measure 0. Since (tti)$j <C £ 2n+1 , it 
follows that 7(A) < (7ri)j 7 (7ri(A)) = 0. □ 

7. Lower density of the transport set 

We consider optimal transport plans in IIi(/i, v) that are obtained as weak limit 
of solutions to the variational approximations introduced in Section [5] We prove 
that if 7 is such a transport plan then it is concentrated on a set T whose related 
transport set has positive lower density at each point x £ 7Ti(r) for some suitable 
notion of lower density. As already mentioned, this is one of the main ingredient 
in the proof of Theorem 18.11 Following the notion of transport set introduced in 
e.g. [1], we define in our geometrical context the transport set related to a set 
rci"x H" as 



T(T) := {( et o S)(x,y); (x,y) £ T, t £ (0, 1)}. 
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Recall that S is a measurable selection of minimal curves and that (e t o S)(x,y) 
denotes the point at distance td(x,y) from x on the selected minimal curve S(x,y) 
between x and y, see Subsection 13.31 



Lemma 7.1. Let 7 £ n(/i, v) obtained as a weak limit of solutions to (P Ek ) for 
some sequence £k converging to 0. Then 7 is concentrated on a set T such that for 
all (x, y) € r such that x ^ y and all r > 0, we have 

. . C 2n+1 (T(T n [B(x, |) x B(y, r)\) D B(x, 5)) 

hmmf „ , f, — ; — > 0. 

sio C 2n+1 (B(x,6)) 

The proof below follows the line of the proof of the similar property in [12] . In 
our context it requires however some technical refinement. 

Proof. We consider the set T obtained by Lemma 16.11 (x,y) £ T with x 7^ y and 
r > 0. Then let y' £ H™ and r' > be given by Lemma \Q. II so that Lemma TO-lf i). 
(ii) and (iii) hold. Using the same notations as in this lemma, we set 

G := {z £ H n ; -p'(x) < p'{z) and p(z) < 2p(x)}. 

Then G is a Borel set. We have < p'{x) < p(x) (remember the convention about 
densities of absolutely continuous measure, see (|6.2[1 ). Since x £ LebpflLebp', see 
Lemma IrTTT ii) and (iii), it follows that x £ LebG. 
Fix 5 > such that 6 < d(x, y) + r and 

(7.1) ^£. 2n+1 (B(x,s)) <C 2n+1 {GnB(x,s)) 

for all s £ (0, 5) and fix t > such that 4t(d(x, y) + r) < S. 

We set Gj:=Gn B(x, f ), A 5 := Gs X 5(y', r') and 75 := 7^5. We shall prove 
that 



(7.2) ^ £ -+i (i? ( X; _ )) < ( ei o5) nj (B(,,i)) 
and 

(7.3) (et o SJhtj^Ci, 5)) < 2 2 "+ 4 p(^)/: 2 " +1 (T(r n [5(1, ~) x 5(2/, r)]) n B(x, 8)). 
Then dZHJ) and (O will yield 

2- (2 " +6) ^/: 2n+1 (5( a ;, ^)) < C 2n+1 (T(Tn[B(x, x 5(y, r)]) n B(x, 8)) 

for any 6 > small enough which completes the proof. 

To prove (|7.2[) . we note that (7ri)j75 <C £2n+i w ith density bounded below by 
\p'{x) £ 2n+1 -a.e. on Gs. Together with flTTTj) . it follows that 

^^(S^l))^ (^75(^1)). 

Next, by choice of S and t, we have (e* o S){z,w) £ B(x,S) for all z £ 5(x, |) and 
w £ 5(y, r), hence 

B(x, 5 -) x 5(y',r') C B(x, ~) x 5(y,r) C (e t o S)" 1 (5(z, <5)) 
and it follows that 

(7ri) |7 ,(B(a; 1 1)) = 7a (B(a;, 5) x 5(y', /)) < (e t o 5) f7a (5(x, 5)) 



and this completes the proof of (|7.2[) . 

We prove now (|7.3p . By hypothesis, 7 is a weak limit of solutions 7^ to (P Efc ) 
for some sequence Ek converging to 0. For each fixed k £ N, we apply Lemma 15.3 
with U = Gs X HP 1 and Proposition 15 . 71 with /Z = (7ri)jt(7fc [U) an d v = (w2)$(jk \U). 
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Taking into account the fact that (TTi)^(jk \U) = (i[Gs, we get that (et° S)$('jk [Gs x 
HP) <C C 2n+1 with density in L°° and whose L°°-norm is bounded by 

(7-4) n \— \\pIg s \\l~ < 2 2n+i p(x). 



(1 - t) 2n + 3 

Next we check that (e t o S)^^ [G$ x HP) converges weakly to (e t o S 1 )}) (7 xi"). 
First it follows from Lemma 17.21 (to be proved below) that jk [Gs x I" converges 
weakly to ^[Gs x HP. Then, noting that 7 and each jk are concentrated on ft and 
that et o S is continuous on Q, the claim follows from Lemma 17.31 (to be proved 
below) applied with 7 = j[Gg x HP, 7 y k = 7^ [Gs x HP, 5 = Q and f — tp o e t o S 
where y> G C(,(H n ). The fact that 7 is concentrated on f2 follows from Lemma [4.11 
To check that jk is concentrated on f2, denote by {yf }i the finite set of the atoms 
of (7T2)j)7fc. We have that 7^ is concentrated on H" x {yf},. On the other hand 
-/k(L y k x {yf}) < ~fk(L y k x H") = fi(L y k) = since # < £ 2 ™ +1 . It follows that 7 fe 

is concentrated on U,[(HI n \ L y k) x {yf}] C fl Then, taking into account (|7.4[> . we 
get 

p d(et o 5) B ( 7 LG 5 x HP)| < 2 2 "+V(.x) |M| l i 

for every <p G C h (H n ). It follows that (e t o S) t {~f[G s x HP) is in (L 1 )' with density 
in L°° and whose L°°-norm is bounded by 2 2n+4 p(x). Since (e t o 5)07,5 < (e t o 
S)jl(7|_G,s x HP), the same holds true for (e t o S*)^- Finally we note that 75 being 
concentrated on T n [B(x, §) x B(y',r')} C T n [S(x, f) x B(y,r)], the measure 
(e t o5)tt7i is concentrated on T(m |) x B(y', r')]) C T(m §) x B(y, r)]). 
All together we get 

(e t o S) as (B(x, 5)) = (e t o S) B75 (T(r n [B(a;, -) x B(y, r)]) n B(x, <5)) 

< 2 2 " +4 P W/: 2 " +1 (T(rn[B( I , £) x %,r)])nB(^)) 

which proves ()7.3|) . □ 

Lemma 7.2. Lei 1 fc a separable and locally compact Hausdorff metric space in 
which every open set is a-compact. Let (jk)k be a sequence in V(X x X) which 
converges weakly to some 7 G V(X x X) and such that (7Ti)j7fc = (7Ti)j7 for every 
k G N. Then for any Borel set G C X, the sequence (7/. [G x X)k converges weakly 
to~f[GxX. 

Proof. We have to prove that for any <p G Cb(X), 



lim / l G (x)ip(x,y)d'jk(x,y) = / H G (x)<p(x, y) d-y(x, y). 

k^ + ooJ XxX JXxX 

It follows from Lusin's Theorem that for any e > there exists a closed set F e 
such that IgLfv ^ s continuous and (^^^(X \ F e ) < e. As a consequence, for every 
e > 0, the restriction of (x,y) 1— > Ho(x)ip(x,y) to F e x X is continuous and 

limsu P7fc ((A \ F e ) x X) = (-Ki)a(X \ F e ) < e. 

k— >-\-oc 

Then since (x,y) 1— > |lG(ar)<^(a;, y)| is bounded and hence uniformly integrable with 
respect to (jk)k, the claim follows from [21 Proposition 5.1.10]. □ 

Lemma 7.3. Let X be a separable metric space and (7/Jfc be a sequence in V(X) 
which converges weakly to some 7 G V(X). Let f : X — » K be a measurable and 
bounded function which is continuous in B for some Borel set B C X such that 
7 fe (A \ B) = for every k G N and j(X \ B) = 0, then 



k- 



lim / fd'fk = / fd-y 



x Jx 
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Proof. Let / and / be respectively the lower and upper semicontinuous envelope 
of /. We have / = / = / on B and hence 7-a.e. and 7fc-a.e. for every k 6 N. It 
follows that 

/ d-y = / / d-y < lim inf / fdry k = lim inf / / d-y k 



X JX ^ — Jx k—>oc jj£ 



< lim sup / fd-y k = lim sup / fdj k < / fd-y = f dj 

k^roo JX fc->oo Jx JX JX 

which proves the claim. □ 

8. Solution to Monge's problem 

We prove that optimal transport plans in III (/i, v) that are obtained as weak limit 
of solutions of the variational approximations introduced in Section [5] are induced 
by a transport, hence giving a solution to Monge's transport problem as stated in 
Theorem ll.il Note that due to the fact that II is relatively compact in V{R n x H n ), 
such optimal transport plans do exist. 

Theorem 8.1. Let e k be a sequence converging to and"f ek a sequence of solutions 
to (P Ek ) which is weakly converging to some 7 G V(W n xi"). Then 7 is concentrated 
on a [i-measurable graph and hence induced by a transport. 



Proof. First we know from Lemma 15.21 that 7 G Il2(/i,^). From the previous sec- 
tions and using inner regularity of Borel probability measures, one can then find 
cr-compact sets T and T' such that f C T C !1 and the conclusions of Lemma 14. 3[ 
Lemma \Q. 2 1 and Lemma \7~J\ hold. We prove here that for any x G 7i"i(r') there is a 
unique y G H n such that (x,y) G T' . 

By contradiction, assume that one can find xq G 7i"i(r') and (xo,yo) G T, 
(^0 , 2/i ) G r with j/o 7^ 2/i • Without loss of generality one can assume that 
d(xo,yo) < d(xo,yi) and xq ^ y±. Then, by Lemma [6.21 and Lemma f7.lt for all 
r > and for all S > small enough, one can find x' G B(xo,S) ("1 T^ 1 (B(y , r)) Pi 
T(r ("1 [B(xq, f) x B(yi,r)]). It follows that one can find y' G B(y ,r) such that 
(x', y') G T and (x, y) G T n (B(x a , |) x B(yi, r)) such that x ^ y, x' ^ x and x' lie 
on the minimal curve between a; and y. Then it follows from Lemma 14.31 that x, x' , 
y and y' lie on the same minimal curve ordered in that way. 

Assume first that d(xo, yo) < d{xo, y\). We know from Lemma 14.31 that d(x, y) < 
d(x, y'). On the other hand, we have 

d(x, y') < d(x, x ) + d(x a ,y ) + d(y , y') 
5 

S 

< d(x , x) + d(x, y) + d(y, yi) + d(x Q ,y ) - d(x , yi) + -+r 

< d(x, y) + d(x ,yo) ~ d(x , y 1 ) + S + 2r. 

It follows that d(x,y') < d(x,y) provided we take r > and S > small enough 
which gives a contradiction. If d(xo,yo) = d(xo,yi), we have 

d{x ,yi) < d(x ,x) + d(x, y) + d{y, y x ) 

= d(x ,x) + d(x, y') - d(y\ y) + d(y, yi) 



< d(x ,yo) 
= d{x , 2/1) + d(x , yo) - d(x , yi) 



<d(x,y')-d(y',y) 
d(x, y') < d(x, x ) + d(x , y ) + d(y , y') < d(x ,y 



S 

2 +r ' 

6 
2 



MONGE'S TRANSPORT PROBLEM IN THE HEISENBERG GROUP 



21 



d{y',y) > d(y , yi ) - d(y ,y') - d( yi ,y) > d(y ,yi) - 2r, 

hence, 

d{x ,yi) < d(x ,yo) - d(y ,yi) + 4r + 5. 

It follows that d(x ,yi) < d(x ,y ) provided we take r > and 5 > small enough 
which gives also a contradiction. □ 

9. Extension to more general metric measure spaces 

First we note that a major part of intermediate steps in the strategy adopted in 
the present paper can be naturally extended to Polish and non-branching geodesic 
spaces equipped with a reference measure for which the Lebesgue's differentiation 
theorem holds. 

Next our choice of approximating costs c e in the approximation procedure is 
not the only possible one. This choice could in particular be adapted to fit other 
contexts (for instance concerning the relevant properties of solutions to the transport 
problem associated to the approximating cost). 

Finally the Measure Contraction Property is here technically very convenient. 
We note however that this property is unnecessarily too strong for what is actually 
needed in the proof about the lower density of the transport set. Much local and 
weaker versions about the behavior of the measure of sets transported along minimal 
curves are indeed sufficient as clearly shows up from the proof. 

This approach can in particular be adapted to give an alternative proof of the ex- 
istence of solutions to Monge's transport problem in the Riemannian setting without 
using Sudakov's type arguments. 

For the reasons listed above it is furthermore very likely that the present strategy 
could be adapted and extended to other geodesic metric spaces. 
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